Dataset statistics
| Number of variables | 11 |
|---|---|
| Number of observations | 2778702 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 233.2 MiB |
| Average record size in memory | 88.0 B |
Variable types
| Categorical | 1 |
|---|---|
| DateTime | 1 |
| Numeric | 9 |
id_estacion has a high cardinality: 207 distinct values | High cardinality |
tmax is highly correlated with tmin | High correlation |
tmin is highly correlated with tmax | High correlation |
longitud is highly correlated with latitud | High correlation |
latitud is highly correlated with longitud | High correlation |
tmax is highly correlated with tmin | High correlation |
tmin is highly correlated with tmax | High correlation |
tmax is highly correlated with tmin | High correlation |
tmin is highly correlated with tmax | High correlation |
tmin is highly correlated with fecha_cnt and 1 other fields | High correlation |
latitud is highly correlated with longitud and 1 other fields | High correlation |
fecha_cnt is highly correlated with tmin and 1 other fields | High correlation |
longitud is highly correlated with latitud and 1 other fields | High correlation |
altitud is highly correlated with latitud and 1 other fields | High correlation |
tmax is highly correlated with tmin and 1 other fields | High correlation |
nevada is highly skewed (γ1 = 589.8913408) | Skewed |
prof_nieve is highly skewed (γ1 = 65.86614932) | Skewed |
precip has 2066251 (74.4%) zeros | Zeros |
nevada has 2778673 (> 99.9%) zeros | Zeros |
prof_nieve has 2772465 (99.8%) zeros | Zeros |
Reproduction
| Analysis started | 2021-10-09 13:00:35.267498 |
|---|---|
| Analysis finished | 2021-10-09 13:01:50.708049 |
| Duration | 1 minute and 15.44 seconds |
| Software version | pandas-profiling v3.0.0 |
| Download configuration | config.json |
| Distinct | 207 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 21.2 MiB |
| SP000009981 | 42185 |
|---|---|
| SP000008280 | 40383 |
| SP000003195 | 36896 |
| SPE00120629 | 36821 |
| SPE00155259 | 36704 |
| Other values (202) |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 11 |
| Min length | 11 |
Characters and Unicode
| Total characters | 30565722 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | SP000003195 |
|---|---|
| 2nd row | SP000003195 |
| 3rd row | SP000003195 |
| 4th row | SP000003195 |
| 5th row | SP000003195 |
Common Values
| Value | Count | Frequency (%) |
| SP000009981 | 42185 | 1.5% |
| SP000008280 | 40383 | 1.5% |
| SP000003195 | 36896 | 1.3% |
| SPE00120629 | 36821 | 1.3% |
| SPE00155259 | 36704 | 1.3% |
| SP000060010 | 36284 | 1.3% |
| SP000008027 | 34301 | 1.2% |
| SPE00120458 | 33080 | 1.2% |
| SPE00119711 | 33057 | 1.2% |
| SPE00120620 | 32466 | 1.2% |
| Other values (197) | 2416525 |
Length
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| sp000009981 | 42185 | 1.5% |
| sp000008280 | 40383 | 1.5% |
| sp000003195 | 36896 | 1.3% |
| spe00120629 | 36821 | 1.3% |
| spe00155259 | 36704 | 1.3% |
| sp000060010 | 36284 | 1.3% |
| sp000008027 | 34301 | 1.2% |
| spe00120458 | 33080 | 1.2% |
| spe00119711 | 33057 | 1.2% |
| spe00120620 | 32466 | 1.2% |
| Other values (197) | 2416525 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 9332042 | |
| 1 | 3867798 | |
| S | 2778702 | 9.1% |
| P | 2778702 | 9.1% |
| 2 | 2339578 | 7.7% |
| E | 2249350 | 7.4% |
| 5 | 1406537 | 4.6% |
| 9 | 1394106 | 4.6% |
| 6 | 1023402 | 3.3% |
| 8 | 967933 | 3.2% |
| Other values (5) | 2427572 | 7.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 22706150 | |
| Uppercase Letter | 7859572 | 25.7% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 9332042 | |
| 1 | 3867798 | |
| 2 | 2339578 | 10.3% |
| 5 | 1406537 | 6.2% |
| 9 | 1394106 | 6.1% |
| 6 | 1023402 | 4.5% |
| 8 | 967933 | 4.3% |
| 3 | 919852 | 4.1% |
| 4 | 856691 | 3.8% |
| 7 | 598211 | 2.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 2778702 | |
| P | 2778702 | |
| E | 2249350 | |
| W | 36816 | 0.5% |
| M | 16002 | 0.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 22706150 | |
| Latin | 7859572 | 25.7% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 9332042 | |
| 1 | 3867798 | |
| 2 | 2339578 | 10.3% |
| 5 | 1406537 | 6.2% |
| 9 | 1394106 | 6.1% |
| 6 | 1023402 | 4.5% |
| 8 | 967933 | 4.3% |
| 3 | 919852 | 4.1% |
| 4 | 856691 | 3.8% |
| 7 | 598211 | 2.6% |
Latin
| Value | Count | Frequency (%) |
| S | 2778702 | |
| P | 2778702 | |
| E | 2249350 | |
| W | 36816 | 0.5% |
| M | 16002 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 30565722 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 9332042 | |
| 1 | 3867798 | |
| S | 2778702 | 9.1% |
| P | 2778702 | 9.1% |
| 2 | 2339578 | 7.7% |
| E | 2249350 | 7.4% |
| 5 | 1406537 | 4.6% |
| 9 | 1394106 | 4.6% |
| 6 | 1023402 | 3.3% |
| 8 | 967933 | 3.2% |
| Other values (5) | 2427572 | 7.9% |
fecha
Date
| Distinct | 44554 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 21.2 MiB |
| Minimum | 1896-11-01 00:00:00 |
|---|---|
| Maximum | 2021-08-10 00:00:00 |
Histogram with fixed size bins (bins=50)
| Distinct | 366 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 183.4308911 |
| Minimum | 1 |
|---|---|
| Maximum | 366 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 21.2 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 19 |
| Q1 | 93 |
| median | 184 |
| Q3 | 274 |
| 95-th percentile | 347 |
| Maximum | 366 |
| Range | 365 |
| Interquartile range (IQR) | 181 |
Descriptive statistics
| Standard deviation | 105.2283277 |
|---|---|
| Coefficient of variation (CV) | 0.573667429 |
| Kurtosis | -1.194650744 |
| Mean | 183.4308911 |
| Median Absolute Deviation (MAD) | 91 |
| Skewness | -0.0032271497 |
| Sum | 509699784 |
| Variance | 11073.00095 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 126 | 7734 | 0.3% |
| 122 | 7732 | 0.3% |
| 95 | 7725 | 0.3% |
| 124 | 7723 | 0.3% |
| 96 | 7722 | 0.3% |
| 93 | 7721 | 0.3% |
| 123 | 7719 | 0.3% |
| 121 | 7716 | 0.3% |
| 125 | 7713 | 0.3% |
| 92 | 7709 | 0.3% |
| Other values (356) | 2701488 |
| Value | Count | Frequency (%) |
| 1 | 7584 | |
| 2 | 7611 | |
| 3 | 7614 | |
| 4 | 7608 | |
| 5 | 7615 | |
| 6 | 7610 | |
| 7 | 7545 | |
| 8 | 7558 | |
| 9 | 7546 | |
| 10 | 7561 |
| Value | Count | Frequency (%) |
| 366 | 1979 | 0.1% |
| 365 | 7601 | |
| 364 | 7607 | |
| 363 | 7587 | |
| 362 | 7600 | |
| 361 | 7611 | |
| 360 | 7583 | |
| 359 | 7562 | |
| 358 | 7576 | |
| 357 | 7578 |
| Distinct | 632 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 200.1744948 |
| Minimum | -196 |
|---|---|
| Maximum | 472 |
| Zeros | 1752 |
| Zeros (%) | 0.1% |
| Negative | 14996 |
| Negative (%) | 0.5% |
| Memory size | 21.2 MiB |
Quantile statistics
| Minimum | -196 |
|---|---|
| 5-th percentile | 74 |
| Q1 | 145 |
| median | 200 |
| Q3 | 257 |
| 95-th percentile | 330 |
| Maximum | 472 |
| Range | 668 |
| Interquartile range (IQR) | 112 |
Descriptive statistics
| Standard deviation | 78.57346962 |
|---|---|
| Coefficient of variation (CV) | 0.3925248804 |
| Kurtosis | -0.2814421038 |
| Mean | 200.1744948 |
| Median Absolute Deviation (MAD) | 56 |
| Skewness | -0.05182643485 |
| Sum | 556225269 |
| Variance | 6173.790128 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 200 | 32790 | 1.2% |
| 150 | 30517 | 1.1% |
| 170 | 30282 | 1.1% |
| 180 | 30107 | 1.1% |
| 160 | 29697 | 1.1% |
| 210 | 29465 | 1.1% |
| 220 | 29318 | 1.1% |
| 190 | 28687 | 1.0% |
| 230 | 27378 | 1.0% |
| 250 | 26688 | 1.0% |
| Other values (622) | 2483773 |
| Value | Count | Frequency (%) |
| -196 | 1 | |
| -191 | 1 | |
| -183 | 2 | |
| -181 | 1 | |
| -175 | 1 | |
| -174 | 1 | |
| -170 | 1 | |
| -167 | 1 | |
| -161 | 1 | |
| -160 | 1 |
| Value | Count | Frequency (%) |
| 472 | 1 | < 0.1% |
| 469 | 1 | < 0.1% |
| 466 | 3 | |
| 462 | 1 | < 0.1% |
| 461 | 1 | < 0.1% |
| 460 | 2 | < 0.1% |
| 459 | 1 | < 0.1% |
| 457 | 3 | |
| 456 | 5 | |
| 455 | 2 | < 0.1% |
| Distinct | 543 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 98.76497408 |
| Minimum | -300 |
|---|---|
| Maximum | 332 |
| Zeros | 17473 |
| Zeros (%) | 0.6% |
| Negative | 217549 |
| Negative (%) | 7.8% |
| Memory size | 21.2 MiB |
Quantile statistics
| Minimum | -300 |
|---|---|
| 5-th percentile | -16 |
| Q1 | 50 |
| median | 100 |
| Q3 | 150 |
| 95-th percentile | 205 |
| Maximum | 332 |
| Range | 632 |
| Interquartile range (IQR) | 100 |
Descriptive statistics
| Standard deviation | 67.83950071 |
|---|---|
| Coefficient of variation (CV) | 0.68687813 |
| Kurtosis | -0.429977061 |
| Mean | 98.76497408 |
| Median Absolute Deviation (MAD) | 50 |
| Skewness | -0.2155019529 |
| Sum | 274438431 |
| Variance | 4602.197856 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 100 | 39078 | 1.4% |
| 150 | 35049 | 1.3% |
| 90 | 34794 | 1.3% |
| 80 | 34575 | 1.2% |
| 120 | 34322 | 1.2% |
| 110 | 33833 | 1.2% |
| 70 | 32829 | 1.2% |
| 140 | 32792 | 1.2% |
| 130 | 32746 | 1.2% |
| 60 | 30502 | 1.1% |
| Other values (533) | 2438182 |
| Value | Count | Frequency (%) |
| -300 | 1 | < 0.1% |
| -282 | 1 | < 0.1% |
| -280 | 1 | < 0.1% |
| -252 | 1 | < 0.1% |
| -248 | 1 | < 0.1% |
| -245 | 1 | < 0.1% |
| -240 | 3 | |
| -236 | 1 | < 0.1% |
| -232 | 1 | < 0.1% |
| -231 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 332 | 1 | |
| 322 | 1 | |
| 319 | 1 | |
| 318 | 1 | |
| 314 | 2 | |
| 310 | 1 | |
| 307 | 1 | |
| 306 | 2 | |
| 304 | 1 | |
| 302 | 1 |
| Distinct | 1383 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16.43283195 |
| Minimum | 0 |
|---|---|
| Maximum | 3600 |
| Zeros | 2066251 |
| Zeros (%) | 74.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 21.2 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 100 |
| Maximum | 3600 |
| Range | 3600 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 59.10199543 |
|---|---|
| Coefficient of variation (CV) | 3.596580042 |
| Kurtosis | 148.4722969 |
| Mean | 16.43283195 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 8.536300929 |
| Sum | 45661943 |
| Variance | 3493.045863 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 2066251 | |
| 1 | 41261 | 1.5% |
| 2 | 35913 | 1.3% |
| 3 | 27965 | 1.0% |
| 4 | 22168 | 0.8% |
| 5 | 19847 | 0.7% |
| 10 | 19741 | 0.7% |
| 6 | 17836 | 0.6% |
| 8 | 15445 | 0.6% |
| 7 | 13172 | 0.5% |
| Other values (1373) | 499103 | 18.0% |
| Value | Count | Frequency (%) |
| 0 | 2066251 | |
| 1 | 41261 | 1.5% |
| 2 | 35913 | 1.3% |
| 3 | 27965 | 1.0% |
| 4 | 22168 | 0.8% |
| 5 | 19847 | 0.7% |
| 6 | 17836 | 0.6% |
| 7 | 13172 | 0.5% |
| 8 | 15445 | 0.6% |
| 9 | 10024 | 0.4% |
| Value | Count | Frequency (%) |
| 3600 | 1 | |
| 3370 | 1 | |
| 3361 | 1 | |
| 3300 | 1 | |
| 3211 | 1 | |
| 3198 | 1 | |
| 3130 | 1 | |
| 2990 | 1 | |
| 2966 | 1 | |
| 2800 | 1 |
| Distinct | 18 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.0002929425322 |
| Minimum | 0 |
|---|---|
| Maximum | 119 |
| Zeros | 2778673 |
| Zeros (%) | > 99.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 21.2 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 119 |
| Range | 119 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.1299940683 |
|---|---|
| Coefficient of variation (CV) | 443.7527979 |
| Kurtosis | 406874.9611 |
| Mean | 0.0002929425322 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 589.8913408 |
| Sum | 814 |
| Variance | 0.01689845778 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=18)
| Value | Count | Frequency (%) |
| 0 | 2778673 | |
| 3 | 5 | < 0.1% |
| 5 | 4 | < 0.1% |
| 8 | 3 | < 0.1% |
| 51 | 2 | < 0.1% |
| 13 | 2 | < 0.1% |
| 28 | 2 | < 0.1% |
| 119 | 1 | < 0.1% |
| 15 | 1 | < 0.1% |
| 43 | 1 | < 0.1% |
| Other values (8) | 8 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 2778673 | |
| 3 | 5 | < 0.1% |
| 5 | 4 | < 0.1% |
| 8 | 3 | < 0.1% |
| 13 | 2 | < 0.1% |
| 15 | 1 | < 0.1% |
| 18 | 1 | < 0.1% |
| 23 | 1 | < 0.1% |
| 28 | 2 | < 0.1% |
| 30 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 119 | 1 | |
| 79 | 1 | |
| 71 | 1 | |
| 69 | 1 | |
| 58 | 1 | |
| 51 | 2 | |
| 46 | 1 | |
| 43 | 1 | |
| 30 | 1 | |
| 28 | 2 |
| Distinct | 143 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.4788465982 |
| Minimum | 0 |
|---|---|
| Maximum | 2499 |
| Zeros | 2772465 |
| Zeros (%) | 99.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 21.2 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 2499 |
| Range | 2499 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 18.30344751 |
|---|---|
| Coefficient of variation (CV) | 38.22403162 |
| Kurtosis | 5792.795106 |
| Mean | 0.4788465982 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 65.86614932 |
| Sum | 1330572 |
| Variance | 335.0161907 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 2772465 | |
| 10 | 1228 | < 0.1% |
| 20 | 537 | < 0.1% |
| 30 | 330 | < 0.1% |
| 51 | 318 | < 0.1% |
| 99 | 264 | < 0.1% |
| 41 | 259 | < 0.1% |
| 201 | 197 | < 0.1% |
| 150 | 187 | < 0.1% |
| 79 | 153 | < 0.1% |
| Other values (133) | 2764 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 2772465 | |
| 10 | 1228 | < 0.1% |
| 20 | 537 | < 0.1% |
| 25 | 8 | < 0.1% |
| 30 | 330 | < 0.1% |
| 41 | 259 | < 0.1% |
| 51 | 318 | < 0.1% |
| 61 | 136 | < 0.1% |
| 71 | 148 | < 0.1% |
| 76 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 2499 | 12 | |
| 2301 | 5 | < 0.1% |
| 2200 | 3 | < 0.1% |
| 2080 | 1 | < 0.1% |
| 1999 | 14 | |
| 1900 | 5 | < 0.1% |
| 1801 | 6 | < 0.1% |
| 1750 | 17 | |
| 1709 | 1 | < 0.1% |
| 1699 | 17 |
| Distinct | 201 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 39.70648301 |
| Minimum | 27.8189 |
|---|---|
| Maximum | 43.5667 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 21.2 MiB |
Quantile statistics
| Minimum | 27.8189 |
|---|---|
| 5-th percentile | 28.4775 |
| Q1 | 38.2828 |
| median | 40.8442 |
| Q3 | 42.0831 |
| 95-th percentile | 43.3669 |
| Maximum | 43.5667 |
| Range | 15.7478 |
| Interquartile range (IQR) | 3.8003 |
Descriptive statistics
| Standard deviation | 3.73806283 |
|---|---|
| Coefficient of variation (CV) | 0.09414238044 |
| Kurtosis | 3.245285269 |
| Mean | 39.70648301 |
| Median Absolute Deviation (MAD) | 1.5958 |
| Skewness | -1.86675048 |
| Sum | 110332483.8 |
| Variance | 13.97311372 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 40.8206 | 42185 | 1.5% |
| 38.9519 | 40383 | 1.5% |
| 40.4117 | 36896 | 1.3% |
| 41.1144 | 36821 | 1.3% |
| 41.4181 | 36704 | 1.3% |
| 28.3089 | 36284 | 1.3% |
| 38.9892 | 36155 | 1.3% |
| 43.3075 | 34301 | 1.2% |
| 28.4631 | 33080 | 1.2% |
| 43.3669 | 33057 | 1.2% |
| Other values (191) | 2412836 |
| Value | Count | Frequency (%) |
| 27.8189 | 17333 | |
| 27.9225 | 10262 | 0.4% |
| 28.0475 | 14855 | |
| 28.3089 | 36284 | |
| 28.4444 | 19576 | |
| 28.4631 | 33080 | |
| 28.4775 | 28684 | |
| 28.6331 | 19113 | |
| 28.9517 | 18366 | |
| 35.2778 | 21544 |
| Value | Count | Frequency (%) |
| 43.5667 | 19167 | |
| 43.5606 | 12347 | 0.4% |
| 43.5381 | 22792 | |
| 43.4917 | 16485 | |
| 43.4644 | 26669 | |
| 43.4292 | 21429 | |
| 43.3669 | 33057 | |
| 43.3606 | 21511 | |
| 43.3542 | 17733 | |
| 43.3075 | 34301 |
| Distinct | 206 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -3.417536764 |
| Minimum | -17.8889 |
|---|---|
| Maximum | 4.2156 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 1933173 |
| Negative (%) | 69.6% |
| Memory size | 21.2 MiB |
Quantile statistics
| Minimum | -17.8889 |
|---|---|
| 5-th percentile | -16.2553 |
| Q1 | -5.6417 |
| median | -3.4503 |
| Q3 | 0.4914 |
| 95-th percentile | 2.3767 |
| Maximum | 4.2156 |
| Range | 22.1045 |
| Interquartile range (IQR) | 6.1331 |
Descriptive statistics
| Standard deviation | 4.689366735 |
|---|---|
| Coefficient of variation (CV) | -1.372148146 |
| Kurtosis | 1.542218244 |
| Mean | -3.417536764 |
| Median Absolute Deviation (MAD) | 2.6053 |
| Skewness | -1.171888945 |
| Sum | -9496316.242 |
| Variance | 21.99016038 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| -3.7892 | 45659 | 1.6% |
| 0.4914 | 42185 | 1.5% |
| -1.8631 | 40383 | 1.5% |
| -3.6781 | 36896 | 1.3% |
| -1.4106 | 36821 | 1.3% |
| 2.1239 | 36704 | 1.3% |
| -16.4992 | 36284 | 1.3% |
| -2.0392 | 34301 | 1.2% |
| -16.2553 | 33080 | 1.2% |
| -8.4192 | 33057 | 1.2% |
| Other values (196) | 2403332 |
| Value | Count | Frequency (%) |
| -17.8889 | 17333 | |
| -17.755 | 19113 | |
| -16.5606 | 14855 | |
| -16.4992 | 36284 | |
| -16.3292 | 28684 | |
| -16.2553 | 33080 | |
| -15.3892 | 10262 | 0.4% |
| -13.8631 | 19576 | |
| -13.6003 | 18366 | |
| -8.6494 | 9706 | 0.3% |
| Value | Count | Frequency (%) |
| 4.2156 | 19696 | |
| 3.1817 | 4598 | 0.2% |
| 3.1658 | 4603 | 0.2% |
| 3.0967 | 4604 | 0.2% |
| 3.0353 | 4601 | 0.2% |
| 3.0325 | 4601 | 0.2% |
| 2.8342 | 1681 | 0.1% |
| 2.8267 | 4003 | 0.1% |
| 2.8253 | 21143 | |
| 2.8067 | 3615 | 0.1% |
| Distinct | 173 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 425.0443864 |
| Minimum | 1 |
|---|---|
| Maximum | 2535 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 21.2 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 42 |
| median | 251 |
| Q3 | 667 |
| 95-th percentile | 1143 |
| Maximum | 2535 |
| Range | 2534 |
| Interquartile range (IQR) | 625 |
Descriptive statistics
| Standard deviation | 506.8215407 |
|---|---|
| Coefficient of variation (CV) | 1.19239674 |
| Kurtosis | 4.529438433 |
| Mean | 425.0443864 |
| Median Absolute Deviation (MAD) | 237 |
| Skewness | 1.941035627 |
| Sum | 1181071687 |
| Variance | 256868.0741 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 4 | 82977 | 3.0% |
| 1 | 59206 | 2.1% |
| 35 | 50120 | 1.8% |
| 32 | 48181 | 1.7% |
| 44 | 42185 | 1.5% |
| 64 | 41524 | 1.5% |
| 704 | 40383 | 1.5% |
| 7 | 39402 | 1.4% |
| 25 | 38990 | 1.4% |
| 5 | 37975 | 1.4% |
| Other values (163) | 2297759 |
| Value | Count | Frequency (%) |
| 1 | 59206 | |
| 2 | 9206 | 0.3% |
| 3 | 22163 | 0.8% |
| 4 | 82977 | |
| 5 | 37975 | |
| 6 | 19208 | 0.7% |
| 7 | 39402 | |
| 8 | 3427 | 0.1% |
| 11 | 31219 | 1.1% |
| 14 | 22967 | 0.8% |
| Value | Count | Frequency (%) |
| 2535 | 4573 | 0.2% |
| 2519 | 4565 | 0.2% |
| 2451 | 4739 | 0.2% |
| 2400 | 4503 | 0.2% |
| 2371 | 36284 | |
| 2316 | 4593 | 0.2% |
| 2266 | 4603 | 0.2% |
| 2247 | 4444 | 0.2% |
| 2230 | 4601 | 0.2% |
| 2228 | 4579 | 0.2% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here. A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
First rows
| id_estacion | fecha | fecha_cnt | tmax | tmin | precip | nevada | prof_nieve | longitud | latitud | altitud | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | SP000003195 | 1920-01-01 | 1 | 119.0 | 88.0 | 0.0 | 0.0 | 0.0 | 40.4117 | -3.6781 | 667.0 |
| 1 | SP000003195 | 1920-01-02 | 2 | 110.0 | 26.0 | 0.0 | 0.0 | 0.0 | 40.4117 | -3.6781 | 667.0 |
| 2 | SP000003195 | 1920-01-03 | 3 | 86.0 | 24.0 | 14.0 | 0.0 | 0.0 | 40.4117 | -3.6781 | 667.0 |
| 3 | SP000003195 | 1920-01-04 | 4 | 68.0 | 25.0 | 0.0 | 0.0 | 0.0 | 40.4117 | -3.6781 | 667.0 |
| 4 | SP000003195 | 1920-01-05 | 5 | 66.0 | 16.0 | 0.0 | 0.0 | 0.0 | 40.4117 | -3.6781 | 667.0 |
| 5 | SP000003195 | 1920-01-06 | 6 | 58.0 | -15.0 | 0.0 | 0.0 | 0.0 | 40.4117 | -3.6781 | 667.0 |
| 6 | SP000003195 | 1920-01-07 | 7 | 69.0 | 8.0 | 0.0 | 0.0 | 0.0 | 40.4117 | -3.6781 | 667.0 |
| 7 | SP000003195 | 1920-01-08 | 8 | 62.0 | -16.0 | 0.0 | 0.0 | 0.0 | 40.4117 | -3.6781 | 667.0 |
| 8 | SP000003195 | 1920-01-09 | 9 | 122.0 | -12.0 | 0.0 | 0.0 | 0.0 | 40.4117 | -3.6781 | 667.0 |
| 9 | SP000003195 | 1920-01-10 | 10 | 97.0 | 9.0 | 0.0 | 0.0 | 0.0 | 40.4117 | -3.6781 | 667.0 |
Last rows
| id_estacion | fecha | fecha_cnt | tmax | tmin | precip | nevada | prof_nieve | longitud | latitud | altitud | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 2778692 | SPW00014011 | 1967-12-22 | 356 | 61.0 | -11.0 | 0.0 | 0.0 | 0.0 | 40.4833 | -3.45 | 608.1 |
| 2778693 | SPW00014011 | 1967-12-23 | 357 | 33.0 | 0.0 | 3.0 | 0.0 | 0.0 | 40.4833 | -3.45 | 608.1 |
| 2778694 | SPW00014011 | 1967-12-24 | 358 | 56.0 | 17.0 | 10.0 | 0.0 | 0.0 | 40.4833 | -3.45 | 608.1 |
| 2778695 | SPW00014011 | 1967-12-25 | 359 | 89.0 | -22.0 | 0.0 | 0.0 | 0.0 | 40.4833 | -3.45 | 608.1 |
| 2778696 | SPW00014011 | 1967-12-26 | 360 | 100.0 | 44.0 | 0.0 | 0.0 | 0.0 | 40.4833 | -3.45 | 608.1 |
| 2778697 | SPW00014011 | 1967-12-27 | 361 | 94.0 | 0.0 | 0.0 | 0.0 | 0.0 | 40.4833 | -3.45 | 608.1 |
| 2778698 | SPW00014011 | 1967-12-28 | 362 | 94.0 | -28.0 | 0.0 | 0.0 | 0.0 | 40.4833 | -3.45 | 608.1 |
| 2778699 | SPW00014011 | 1967-12-29 | 363 | 72.0 | -33.0 | 0.0 | 0.0 | 0.0 | 40.4833 | -3.45 | 608.1 |
| 2778700 | SPW00014011 | 1967-12-30 | 364 | 61.0 | -33.0 | 0.0 | 0.0 | 0.0 | 40.4833 | -3.45 | 608.1 |
| 2778701 | SPW00014011 | 1967-12-31 | 365 | 67.0 | -61.0 | 0.0 | 0.0 | 0.0 | 40.4833 | -3.45 | 608.1 |